Guidage: a Fast audio Query Guided assemblage
نویسندگان
چکیده
In this article, a method is proposed for fast and automatic retrieval of factors of audio content in a large audio database based on user’s audio query. The proposed method, unlike most existing systems, takes explicit considerations of temporal morphology of audio content. This work touches upon several existing approaches and technologies for sound manipulations, such as sound texture synthesis, music and audio mosaicing on the synthesis side, and audio matching, query by audio and audio structure discovery on the analysis side. Destined for creative applications, the proposed method is modular by allowing interactive choice of search criteria. The analysis side of the proposed model features a new audio structure discovery algorithm called Audio Oracle that describes the temporal morphologies of the underlying sound as a compact state-space model. The search engine, and the main focus of this paper, features a fast and novel algorithm based on dynamic programming called Guidage that is capable of reassembling the query audio by concatenating subclips of target audio files. Demonstrated results suggest a degree of semantic-driven control for query guided applications. The article concludes with examples of two immediate applications of audio matching using Guidage on music, speech and natural sounds and a discussion on further development and use of such methods in interactive and creative environments.
منابع مشابه
Prototyping a Vibrato-Aware Query-By-Humming (QBH) Music Information Retrieval System for Mobile Communication Devices: Case of Chromatic Harmonica
Background and Aim: The current research aims at prototyping query-by-humming music information retrieval systems for smart phones. Methods: This multi-method research follows simulation technique from mixed models of the operations research methodology, and the documentary research method, simultaneously. Two chromatic harmonica albums comprised the research population. To achieve the purpose ...
متن کاملBiopsies prostatiques sous guidage échographique 3D et temps réel (4D) sur fantôme. Etude comparative versus guidage 2D
Conclusion : La méthode de biopsies de prostate par guidage échographique 3D temps-réel semble montrer sur modèle synthétique une amélioration dans la précision localisatrice et dans la faculté à reproduire un protocole. La répartition des biopsies ne semble pas améliorée.
متن کاملFast Hamming Space Search for Audio Fingerprinting Systems
In music information retrieval, a huge search space has to be explored because a query audio clip can start at any position of any music in the database, and also a query is often corrupted by significant noise and distortion. Audio fingerprints have recently attracted much attention in music information retrieval, for they provide a compact representation of the perceptually relevant parts of ...
متن کاملFast vocabulary-independent audio search using path-based graph indexing
Classical audio retrieval techniques consist in transcribing audio documents using a large vocabulary speech recognition system and indexing the resulting transcripts. However, queries that are not part of the recognizer’s vocabulary or have a large probability of getting misrecognized can significantly impair the performance of the retrieval system. Instead, we propose a fast vocabulary indepe...
متن کاملUse of GPU and Feature Reduction for Fast Query-by-Example Spoken Term Detection
For query-by-example spoken term detection (QbE-STD) on low resource languages, variants of dynamic time warping techniques (DTW) are used. However, DTW-based techniques are slow and thus a limitation to search in large spoken audio databases. In order to enable fast search in large databases, we exploit the use of intensive parallel computations of the graphical processing units (GPUs). In thi...
متن کامل